Strong localization of positive charge in DNA 
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Microscopic mechanisms of positive charge transfer in DNA remain unclear. A quantum state of 
electron hole in DNA is determined by the competition of a pi-stacking interaction b smearing a 
charge between different base pairs and interaction A with the local environment which attempts to 
trap charge. To determine which interaction dominates we investigated charge quantum states in 
various (GC)n sequences choosing DNA parameters satisfying experimental data for the balance of 
charge transfer rates <-> Gt, n = 2,3 [ij]. We show that experimental data can be consistent 
with theory only under an assumption 6 <^ A meaning that charge is typically localized within a 
single G site. Consequently any DNA sequence including the one consisting of identical base pairs 
behaves more like an insulating material than a molecular conductor. 

PACS numbers: 7080.Le, 72.20.Ec, 72.25.-b, 87.14.Gg 
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1. Positive charge transfer in DNA is being exten- 
sively investigated since its experimental discovery 
while to our knowledge there is no experimental demon- 
stration of the negative charge transfer. Electron transfer 
in DNA can be responsible for the oxidative DNA dam- 
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H, Q and is possibly important for DNA 
Also an ability of DNA to promote long 
distant charge transfer can be used in molecular electron- 
ics Q . Therefore this problem is extensively investigated 
during the past decade but the microscopic mechanism of 
charge transfer is not understood yet. We hope that this 
work may shed some light on this challenging problem. 

DNA contains two different sorts of base pairs AT and 
GC forming quasi-random sequences. The lowest ion- 
ization pot ential is attributed to a GC pair (essentially 
G-base [10|). Since the electron transfer integral b be- 
tween adjacent bases does not exceed AT - GC ionization 
potential difference 6 < A ^ 0.5eV, the quantum state 
of charge in a frozen environment will be localized near 
some C base and its localization length is comparable 
to the interbase distance[ll]. Interaction with the envi- 
ronment breaks down this localization, inducing charge 
hopping between adjacent quantum states localized at G 
sites. Indeed, according to experimental studies 0| 
and theoretical model the sequence dependent 

charge transfer in DNA can be represented as the series 
of charge hops between adjacent G bases serving as cen- 
ters of localized states. An addition of AT pair between 
adjacent GC pairs dramatically reduces the charge hop- 
ping rate JJ5] and therefore optimum base sequence 
for the most efficient charge transfer consists of identical 
base pairs. Therefore we are going to study a charge 
(hole) quantum state in sequences of identical GC base 
pairs. 

The thermal energy at room temperature is very small 
compared to other characteristic energies of the problem 
so we can use the ground state of the hole coupled to en- 
vironment as a representative state. Then the spatial size 
of the hole state is determined by the competition of the 



delocalization of charge due to the pi-stacking interaction 
of heterocyclic groups belonging to adjacent bases and 
the localization caused by the environment polarization 
around the charge. The charge delocalization energy can 
be characterized by the effective electron transfer integral 
b and the localization energy is g iven by the medium re- 
organization energy A [il [il [M [13, [11 . Delocahzation 
of charge over k base pairs leads to the gain in the energy 
Edei ^ — 2fe + 6/fc^, while the reorganization energy scales 
with the size of charge wavefunction as Eioc ~ — A/fc (see 
e. g. [l3|)- Optimization of the total energy with respect 
to the number of sites k estimates the size of the ground 
state charge wavefunction k 2b/ X. At zero temper- 
ature the hole is localized, while at finite temperature 
it can hop to different states because of its interaction 
with the fluctuating environment. In the translationally 
invariant system ((GC)„ or (AT)„) the potential barrier 
separating two configurations can be estimated as the en- 
ergy price for increasing the size of the wavefunction by 
one more site k —^ k-\-l compared to its optimum state. If 
6 ^ A then this energy is given by the half of the reorga- 
nization energy A « A/2 ^ ksT, while in the opposite 
limit we get a very small value A ~ A'*/(166^) which 
quickly becomes negligible at moderately large b/X. 

It is important to determine the true relationship of 
b and A in DNA. Indeed, in the regime 6 ^ A a DNA 
molecule made of identical base pairs would behave as 
a one-dimensional conductor, while in the opposite limit 
DNA always acts like an insulator. Existing estimates in 
literature 0, [11, [H, [H, [13, [3 [Hi do not help much in 
resolving this issue, because there is a large controversy 
between different approximations. In particular, various 
estimates for the electron transfer integral b range from 
0.05eV[l3| to 0.5eV [l^, and all calculations of b ignore 
vibrational rearrangements [2l) . The estimates for the 
reorganization energy A using the continuous medium 



approach range from 0.25eV[18j to more than leV 
due to uncertainty in the water dielectric constant value 
near the DNA molecule. Therefore based on different ap- 
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proaches different conclusions are made about the charge 
state inchiding a prediction of a propagating intermedi- 



ate size polaron 



preaiction ( 
m H [11 



or a small radius polaron 



essentially localized within a single base[l7|. 

To resolve this problem we suggest an alternative 
method to study the charge quantum state within the 
DNA molecule, using experimental data sensitive to the 
relationship of two key parameters of the theory, b and A. 
Namely, we exploit the rate constants for the balance of 
charge transfer rates between different (G'C)„ complexes, 
measured by Lewis and coworkers [l[ 
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In the thermal equilibrium these ratios are determined 
by the base pair partition functions 



r2 



fcGG 



f^GGG 
f^GGG 



Z2+Z1 
Z2Z1+ ' 
Z3+Z1 

ZzZ\j^ 



(2) 



where Z„_|_ stands for the partition function for G„ se- 
quence containing a single hole in it, while Z„ is the par- 
tition function for the same base sequence, but without 
the hole. Since G and G„ complexes are separated by 
some AT bridge we can ignore their interactions. 

Both ratios in Eq. ([1]) depend on two parameters 
h and A and the thermal energy at room temperature 
fc^T ^ 0.026ey. Below we calculate both ratios using 
tight binding model for G„ complexes and standard lin- 
ear response theory for charge interaction with the en- 
vironment [isl . [20| . Theory determines the domain of 
parameters A and h satisfying experimental data Eq. ([T]). 
We demonstrate, that any choice of A and b, satisfying 
Eq. corresponds to the regime b <C A, where the hole 
in its ground state is localized essentially in a single G 
base (see Fig. Qp. 

2. The chain of n GC base pairs can be described 
by the tight binding Hamiltonian coupled to the classical 
environment represented by coordinates Xi, i = l,...n 
coordinates for each DNA site 

H — Hfiole + Hmedia + Vint , 
n-1 

Hhole = -6^(c+Ci+i + C^iC,;), 
i=l 

^ n n 

-^Xf, V-„, = -^X,c+Q. (3) 
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Here Ci , are operators of creation and annihilation of 
electron hole in a site i. Classical coordinates Xi de- 
scribing the polar environment are directly coupled to 



the local charge density = cfci. The solvent energy is 
expressed as a bilinear form with respect to solvent coor- 
dinates, which is justified by a standard assumption that 
polarization fields are small compared to atomic fields [20j 
so we can ignore X^ terms. We assume that only classi- 
cal degrees of freedom with excitation energy comparable 
or less than the thermal energy are left in Eq. ([3]), while 
high energy modes are integrated out. This may lead to 
the renormalization of parameters in the system Hamilto- 
nian Eq. ([3]) (see e. g. 21, 22|) and we assume that this 
renormalization is made. We do not include off-diagonal 
terms X^Xj, i ^ j into the Hamiltonian. This is justified 
because they are smaller than the diagonal ones 16]. It 
can be shown that for G2 sequence the problem including 
off-diagonal terms can be reduced to the diagonal model 
Eq. ([3]) with the replacement of the single site reorgani- 
zation energy A with the reorganization energy for charge 
transfer between adjacent sites. For GGG sequence the 
similar replacement with removal off-diagonal terms re- 
mains a good approximation [2^. Note that the addition 
of A or T bases surrounding G„ sequences leads to small 
changes in our results because the electron transfer in- 
tegral is smaller in average between A and G base then 
between two G bases and also because of the large ion- 
ization potential difference of A and G bases . Also 
we ignore the second Cn strand because of the weak cou- 
pling between strands [iTj and large difference of their 
ionization potentials flo'|. 

We study the ratios of charge transfer rates Eq. 
([2|). Each partition function is given by Z„ — 
j dXi...dXnTre~^^^ , where Hn is G„ Hamiltonian Eq. 
([3]), trace is taken only over the states with the single 
hole {Zn+) or no holes {Zn) and (3 — 1/(A;bT). If there 
is no hole calculations are reduced to multiple evaluation 
of a Gaussian integral leading to 
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For the sequences containing a hole an analytical expres- 
sion can be obtained only for n = 1 



ry fi\/2 

Zi+ = ceJ ' . 



(5) 



For n = 2, 3 one can perform analytical integration over a 
"center of mass" coordinate Xi^ ..Xn which is coupled to 
the conserving operator of the total number of particles 
Cj^ci + .. + c+c„ = 1. This reduces the calculations to 
single and double integrals, respectively. Below we give 
the expression for (GG)+ partition function 



= {V2cy 



due 



cosh \P\l^+h-^\ ,(6) 



while the expression for Z3+ is more complicated and will 
be published elsewhere [23j. 
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3. We have performed numerical evaluations of ratios 
in Eq. ([2]) to find domains of parameters b and A satis- 
fying Eq. ^ and show these domains in Fig. [TJ The 
upper (lower) border of each domain is defined by the 
maximum (minimum) value of ratios r2 and Eq. Q 
within the experimental error (8.7 (6.7) for GG and 21 
(19) for GGG). The acceptable domain of parameters 
for the GGG sequence fully belongs to the correspond- 
ing domain for the GG sequence. Thus the domains for 
GG and GGG base sequences are completely consistent 
with each other. Therefore we cannot determine param- 
eters A and b better then using the "dark" domain for 
GGG. This information is still sufficient to consider the 
localization of the hole wavefunction in G„ aggregates. 

Since the thermal energy ksT w 0.026eV is smaller 
than other characteristic energies of the system (remem- 
ber that the minimum estimate for the reorganization 
energy is A ~ 0.25eV [ISj]) we can characterize the wave- 
function using the system ground state at coordinates Xi 
minimizing the ground state energy. In the relevant do- 
main of parameters in Fig. [T](A > 0.25) the ground state 
wavefunction is centered at one of G bases (left or right 
ones for the GG sequence and the central one for a GGG 
sequence). We characterize this state by the probability 
Pq for the particle in the ground state to be in this cen- 
tral site. One can show that Pq ~ Xi/X, where i is the 
central site;23]. 

For instance, for the GG sequence the expression for 
the hole ground state energy at arbitrary coordinates Xi , 
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FIG. 1: The domains consistent with the experimental ra- 
tios of reaction rates Eq. dark grey for GGG and light 
grey for GG. Inset shows the fraction of the quantum charge 
state belonging to the central site (Pq) vs. the reorganization 
energy A. 



X, reads E, = - ^ - i,, 

the regime of interest 26 < A (see Fig. [T]) the minimum 
of energy is given by 

E2m^n = -A/2 - / \ (7) 

and it is reahzed at Xi = A/2 ± ^(A/2)2 - foz ^ A - 
X2. Accordingly Pq = ^ = A+y|IEi5!. Note that if 
2A < & the ground state wavefunction is symmetric in 
the minimum Xi = X2 = A/2 and the energy of this 
state is given by 

E2symm = —A/4 — 6. (8) 

In the case of 26 < A this symmetric state is the transition 
state (saddle point in the energy function E2{Xi,X2)) 
between the energy minima centered in the first and the 
second G"s. 

For the GGG complex the probability to reside in the 
central site was evaluated numerically. Both probabili- 
ties Pq obtained for the X - b line corresponding to the 
ratio r2 = 7.7 Eq. ^ are shown in the inset in Fig. [T] 
It is clear from this graph that for both GG and GGG 
sequences the wavefunction of the hole is essentially lo- 
calized in the single G site. For instance at the minimum 
value of A 0.25eV we have 85% and 78% of the prob- 
ability to find the particle in that site for GG and GGG 
sequences, respectively. These probabilities increase with 
increasing of the reorganization energy to leV up to 96% 
and 94%, respectively. 

Thus we come to the important conclusion that wave- 
functions of hole are essentially localized in the single G 
site for Gn sequences. This conclusion differs from the 
predictions of previous work 0, [l^, [l^ where the polaron 
of an intermediate range was used to describe the quan- 
tum state of the hole. Our conclusion is justified by the 
agreement with the experiment [l|. 

4. An impressive consistency between GG and GGG 
in Fig. [T] is not accidental and can be explained by 
the strong localization of charge wavcfunctions. In the 
regime of strong localization the partition function Zn+ 
for n > 2 consists of n contributions of energy minima 
corresponding to wavcfunctions centered in all n G sites 
with coordinates X realizing the corresponding energy 
minimum w A 3> Xk, k ^ i for the state centered at 
site i. Since in the zero order approximation in 6/ A each 
quantum state is localized at one site one can neglect the 
difference in preexponential factors for the case of 6 = 
and we can approximate the partial contribution to 
the partition function as = c"e~^^* (see Eq. (j4])), 
where Ei is the energy of the ground state for coordi- 
nates X realizing the local minimum. First order cor- 
rection to the energy p'"' = —A/2 is important because 
it is in the exponent and multiplied by the large factor 
(3. For the two states at the edges this correction can be 
expressed as Ei = En « —A/2 — 6^ /A, which coincides 
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with the ground state energy for GG Eq. ([7]) . This is not 
surprising because the contributions of non-neighboring 
sites is neghgible due to the strong locahzation of charge. 
For n — 2 remaining states the correction to the energy 
should be doubled because of the addition of contribu- 
tions of two neighbors so we got iS^^'' — —A/2 — 
1 < i < n. Consequently, we can approximate the ratio 
r„ (cf. Eq. (O) as 

r„ = Z„+Zi/(Z„Zi+) « 2e^^'/^ + {n - 2)e'^'''/\ (9) 

Particularly, one can show that (^3+^1/(^3^1+) « 
{Z2+Zi/{Z2Zi+) + ((Z2+Zi/(2Z2Zi+))2) and this rela- 
tionship is satisfied for the experimental values of ratios 
within the accuracy of the experiment. This explains the 
consistency of domains for GG and GGG Fig. [TJ Using 
Eq. ^ one can predict that ratios r„ form arithmetic 
series. Particularly for the balance between G and G4 
sequence we predict the ratio = 2r3 — r2 = 32.3. This 
estimate agrees with our numerical calculations for the 
G^ partition function. One should notice that in case 
of delocalization b > A/2 the difference in energy be- 
tween G'^ and is 6 > 0.15eV and one would expect 

b 

r2 > e''B'^ > 300 which is not the case in the experiment. 

5. Thus we considered the quantum state of the pos- 
itive charge (hole) in poly-G - poly-G base sequence. It 
turns out that the agreement with the experimental data 
for the ratios r„ Eq. ([2]) for n = 2, 3 can be attained only 
assuming the strong localization of charge within almost 
a single G-base. The charge in DNA then behaves as a 
small polaron with the size less than the interbase dis- 
tance. Based on our theory we predict all other ratios 
r„ = 7.7 + 12.3 • (n — 2). Notwithstanding this prediction 
we are not able to identify more accurately the electron 
transfer integral b and the reorganization energy A using 
experimental data only for the ratios. 

Based on our theory we can suggest to find these pa- 
rameters measuring the temperature dependence of the 
charge transfer rate through poly-G - poly-G base se- 
quence. We expect that this temperature dependence 
will be described by the Arrhenius law with the activa- 
tion energy defined by the difference of charge symmetric 
transition state energy within (GG)"*" base pair Eq. ([8]) 
and the charge ground state energy for (GG)"*" state Eq. 
EA^X/^-b + byX. 
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